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Abstract 

For asexual organisms point mutations correspond to local displacements 
in the genotypic space, while other genotypic rearrangements represent long- 
range jumps. We investigate the spreading properties of an initially homoge- 
neous population in a flat fitness landscape, and the equilibrium properties on 
a smooth fitness landscape. We show that a small-world effect is present: even 
a small fraction of quenched long-range jumps makes the results indistinguish- 
able from those obtained by assuming all mutations equiprobable. Moreover, 
we find that the equilibrium distribution is a Boltzmann one, in which the 
fitness plays the role of an energy, and mutations that of a temperature. 

I. INTRODUCTION 

Darwinian evolution on asexual organisms acts by two mechanisms: mutations, that 
increase the genetic diversity, and selection, that fixes and reduces this diversity. 

We classify mutations into point ones, corresponding to local displacements on the geno- 
typic space (defined more accurately in the following), and other type of mutations or 
rearrangements, which imply larger jumps. Mutations are quite rare, so we can safely as- 
sume that for each generation at most only one mutation occurs. From a biological point 
of view, this discussion applies to unicellular asexual organisms, for which there is no dis- 
tinction between somatic and germ cells. Moreover, we do not consider the possibility of 
recombination (exchange of genetic material). For non-recombinant (asexual) organisms, 
the combined effects of reproduction and mutations correspond to a random walk on the 
genotypic space. Even sexual (recombinant) populations do transmit asexually some part 
of their genotype, such as mithocondrial or y-chromosome DNA, to which the following 
analysis applies. Furthermore, we assume a constant environment. 



We schematize the effects of selection by the selective reduction of the survival prob- 
ability. Selection results from several constraints and can affect the frequency of certain 
genotypes, or even eliminate them from the population. We classify the components of 
selection into static and dynamic ones. In the first class we put the constraints that are 
independent on the population distribution, such as the functionality of a certain protein or 
the tuning of a metabolic path. The static part of the selection is equivalent to the concept 
of "fitness landscape" . The dynamic part of the selection originates from to the com- 
petition among individuals. We assume in this schematization that the competition only 
arises because the population shares some external resource which is evenly distributed, i.e. 
we disregard the effects of the inter- individual competition @,f|. In this limit the effects of 
competition do not depend on the distribution of genotypes, and simply limit the total size 
of the population. 

The most important effects of the selection can be roughly schematized by assuming 
that certain genotypes are forbidden, so that they are eliminated from the accessible space. 
Let us consider for the moment that evolution takes place in a flat fitness landscape. In 
this frame, there are no interactions among individuals: the evolution is given by the simple 
superposition of all possible lineages. The probability distribution of the population in the 
genotypic space at a given time can be obtained by summing over all possible individual 
"histories" in a way similar to the path integral formulation of statistical mechanics ||. 
In this view, the phylogenetic lineage of an individual is given by the path connecting the 
genotypes of ancestors. 

This assumption has to fail somewhere, since otherwise evolution would be equivalent to 
a diffusion process, without anything favoring the formation of species. It often assumed that 
speciation events are rare and occur in a very short time ||, either due to some change in 
the fitness landscape (caused by an external catastrophes like the fall of a large meteorite, or 
by an internal rearrangement induced by coevolution) or because small isolated populations, 
escaping from competition, are free to explore their genotypic space (genetic drift) and find a 
path to an higher fitness peak 0. Therefore, it is usually assumed that one can reconstruct 
the diverging time of two species from their genotypic distance from a single speciation 
event §. 

It is however known from recent investigations about the small- world phenomenon II, 



that diffusion is seriously affected even by a small fraction of long-range jumps [ID], This 
fact could have dramatic consequences in our understanding of the large-scale evolution 
mechanism. If the short-range mutations are dominant, the evolution is equivalent to a 
diffusion process in the genotypic space. Assuming that the fitness landscape is formed by 
mountains separated by valleys, and that the crests of mountains are almost flat, then the 
large-scale evolution is dominated by the times needed to cross a valley by chance, while the 
short-scale is dominated by the neutral exploration of crests flBlJTll . 

Vice versa, if the long-range mutations are important (eventually amplified by the small- 
world mechanism), the time needed to connect two genotypes does not depend on their 
distance nor by the shape of the fitness landscape and the fitness maxima are quickly popu- 
lated. Moreover, in this scenario, the speciation phenomenon should not be ascribed to the 
"discovery" of a preexisting niche, but rather to the formation of that niche in a given ecosys- 
tem due to internal interactions (coevolution) or external physical changes (catastrophes). 
After the formation, the niche is quickly populated because of the long-range mutations. In 
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this framework, allopatric speciation 0] looses its fundamental importance, and sympatric 
speciation due to coevolution [12,13 becomes a plausible alternative. 

In order to evaluate quantitatively the relevance of these speculations, we introduce an 
individual-based model of an evolving population. We assume that the genetic information 
(the genotype) of an individual is represented by a binary string x = (xi,X2, ■ ■ ■ , xl) of L 
symbols Xi — 0,1 (multilocus model with two alleles). In this way we are modeling haploid 
organisms, i.e. bacteria or viruses or, more appropriately, more archaic, pre-biotic entities. 
The choice of a binary code is not fundamental but certainly makes things easier. It can be 
justified by thinking at a purine-pyrimidine coding, or to "good" and "bad" alleles for genes 
(units of genetic information). In this second version, represents a good gene and 1 a bad 
one. This is a sort of "minimal model" which is often used to model the evolution of genetic 
populations []E^-|lB|. For bacteria, one can give indications that major and minor codons 



work in such a way ||17|| . The genotypic space is thus a Boolean hypercube of L dimensions, 
and each individuals sits on a corner of this cube, according to its genotype. The 2 L corners 
of this hypercube represent all possible genotypes, which are at a maximum distance equal 
to L. 

In the next section, we formalize the model and introduce the mathematical representa- 
tion of mutation mechanisms. Then, in Section |TJ, we consider the consequences of short 
and long-range mutations for a flat fitness landscape. In the long-range case all genotypes 
are connected, regardless of their mutual distance: this case can be considered the equiva- 
lent of a mean-field approximation. We derive analytically the rate of spreading v and the 
characteristic spreading time r of a genetically homogeneous inoculum in the short-range 
(v s , t s ) and long-range (vi, re) cases. In the first case, v s is independent of the genotype 
length L, and t s grows linearly with L; the opposite happens in the long-range case. Clearly, 
this different behavior has dramatic evolutionary consequences as L becomes large. 

In general, however, only a small set of all possible long-range mutations is observed 
in real organisms. We thus compute numerically the rate of spreading v in a mixed short- 
range and sparse long-range case. A small-world effect can be observed: in the limit of 
large genotype lengths, a vanishing fraction of long-range mutations cooperates with the 
short-range ones to give essentially the mean-field results. 

In Section [IV], we show that the effects of mutations and selection can be separated 
in the limit of a very smooth fitness landscape and mean-field long-range mutations. In 
this approximation we obtain analytically that the asymptotic probability distribution is a 
Boltzmann equilibrium one, in which the fitness plays the role of of an energy and mutations 
correspond to temperature. We compute numerically the asymptotic probability distribution 
for several fitness landscapes, finding that the equilibrium hypothesis is verified. As a 
nontrivial example of small-world effects in evolution, we checked numerically that this 
scenario still holds for a sparse long-range mutation matrix. Conclusions and perspectives 
are drawn in the last section. 



II. THE MODEL 

In order to be specific, let us assume that the genotype of an individual is represented 
by a Boolean string of length L. In this way, the genotypic space is a Boolean hypercube 
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with 2 L nodes. 

The genotype of an individual is represented by a string x = (x±,X2, • • • , xl) of L Boolean 
symbols Xj = 0,1. Each position corresponds to a locus whose gene has two allelic forms. 
In this way we are modeling haploid (only one copy of each gene) organisms, i.e. bacteria or 
viruses or, more appropriately, more archaic, pre-biotic entities. The genotype can be also 
viewed as a spin configuration. In this case we use the symbol <7j = Ixi — 1. 

We consider two kinds of mutations: point mutations, that interchange a with a 1, 
and all other mutations that do not alter the length of the genotype (transposition and 
inversions). We define the distance d(x,y) between two genotypes x and y as the minimal 
number of point mutations needed to pass from x to y (Hamming distance) 

L 

d(x,y) = ^2(xi- yi ) 2 . 

i=i 

All possible genotypes of length L are distributed on the 2 L vertices of an hypercube. A 
point mutation corresponds to a unit displacement on that hypercube (short-range jump). 

The occurrence of point mutations in real organisms depends on the identity of the sym- 
bol and on its position on the genotype; in the present approximation, however, we assume 
that all point mutations are equally likely. Moreover, since the probability of observing a 
mutation is quite small, we impose that at most one mutation is possible in one generation. 
The probability to observe a point mutation from genotype y to genotype x is given by 
the short-range mutation matrix M s (x,y). By denoting with n s the probability of a point 
mutation per generation, we have 

{I- Us Hx = y, 
^ ifd(aj,y) = l, (1) 
otherwise. 

Other types of mutations correspond to long-range jumps in the genotypic space. The 
simplest approximation consists in assuming all mutations equiprobable. Let us denote with 
\ig the probability per generation of this kind of mutations. The long-range mutation matrix, 
Mg(x,y), is defined as 

| 1 - fi e if x = y, 
Mg(x, y) — < _jH — otherwise. ^ 
I 2 L - 1 

In the real world, only some kinds of mutations are actually observed. We model this fact 
by replacing M g with a sparse matrix Mg. We introduce the sparseness index s, which is 
the average number of nonzero off-diagonal elements of Mg. The sum of these off-diagonal 
elements still gives /ig. In this case Mg is a quenched sparse matrix, and Mg can be 
considered the average of the annealed version. 

After considering both types of mutations, the overall mutation matrix is M = M e M s 
or M = MgM s for the quenched version. 
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We model our population at the level of the probability distribution of genotypes p = 
pit), thus disregarding spatial effects. The evolution equation for p is 



A(x)J2 M(x,y)p(y) 
P[x) = —= , (3) 

where the selection function A(x) corresponds to the average reproduction rate of indi- 
viduals with genotype x, and A = J2 X A(x)p(x) is the average reproduction rate of the 
population HgPllDl. We write A ( x ) 

in an exponential form A(x) = exp(V(a;)), and we 



denote V(x) with the term fitness landscape. 

The selection does not act directly on the genotype, but rather on the phenotype (how 
an individual appears to others). The phenotype of a given genotype can be interpreted 
as an array of morphological characteristics. We consider the simplest case, in which the 
phenotype if univocally determined by the genotype x, which is not the general case, since 
polymorphism or age dependence are usually present. The general mapping between geno- 
type and phenotype is largely unknown and is expected to be quite complex. The effects of 
some genes are additive (non-epistatic), while others can interact in a simple (control genes) 
or complex (morphologic genes) way. 

A possible way of approximating these effects is to use the following form for the fitness 
V(x): 

V(x) = —^a i + j—^2a i a i+l + 1Cr l (x), (4) 

8=1 i=l 

where rj(x) is a random function of x, uniformly distributed between —1 and 1 ((rj(x)rj(y)) = 
&xy)- The "field" term "H represents a non-epistatic contribution to the fitness, in which all 
genes have equal weight. The "ferromagnetic" term J represents simple interactions between 
pairs of genes (even though in general those are not symmetric) and corresponds to a weakly 
rough landscape. Finally, fC modulates a widely rough landscape and can be thought as an 
approximation of the effects of complex (spin-glass like) interactions among genes. 



III. SPREADING AND SMALL- WORLD EFFECTS ON A FLAT FITNESS 

LANDSCAPE 

In this section we study the case of a flat fitness landscape (no selection), i.e. Ti = J = 
K, = 0. In this way we are modeling the evolution on the crest of a mountain, assuming that 
all deleterious mutations are immediately lethal (the part of genotype that can originate 
this kind of mutations is not considered), and that we can neglect the small variation of 
fitness along the crest. This landscape is the one usually considered in the theory of neutral 
evolution pT| . Let us assume that this crest is colonized by a founder deme genotypically 
homogeneous (a delta peak in the genotypic distribution). We want to obtain the average 
time needed to populate the crest according with the different mutation schemes. Since the 
fitness landscape is flat, A is a constant. 
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We are interested in the behavior of p(t), which is the average distance of the population 
from a given starting genotype Xq, i.e. 

p(t) = ^2p(x,t)d(x,x Q ). (5) 

X 

We introduce the spreading velocity v as 

dp 

In Appendix A we obtain the spectral properties of the mutational matrices M s and 
Me. The corresponding spreading velocities in the limit of small time interval t compared 
to the characteristic spreading time r s = L/2p s (short-range) and r e = 1/pi (long-range) 
are v s = p s , Eq. (|T^), and vi = Lp £ /2, Eq. (|T3|) . If short and long-range mutations coexists, 
one has v = v s + vi, Eq. flT4]). 

One can approximate the behavior of real biological systems by considering a mixture of 
short-range mutations, which occur with a relatively high frequency, and sparse long-range 
mutations, with sparseness index s. 

We investigated the sparse case by numerical simulations, for some genotype lengths 
L. As shown in Figure [l|, as soon as the sparseness index s > 0, the numerical value of v 
becomes very close to the mean-field one, Eq. ([14]). Notice that the average distance from 
inoculum, p(t), is rather insensitive to the distribution of p. The actual distribution can be 
quite different from the one obtained with the mean-field matrix M^. 

This transition may be interpreted as an indication of a small- world effect, and that 
there exists a first-order transition at s = [20|| . However, the fluctuations of the velocity, 
Var(f), appears to diverge at s = with an exponent 1, as shown in Figure 0. 

These results suggest the presence of a small-world phenomenon in evolution: the rare 
and sparse long-range mutations cooperate synergetically with the short-range ones to give 
essentially a mean- field effect. We checked that the previous results hold also for a non-flat 
(but smooth) fitness landscape (p s = 10~ 5 , pi = 10~ 4 and TL = 10~ 4 , J = 0, K, = 0). 
Again, as soon as s > 0, the spreading velocity increases from the short-range values to the 
mean-field ones. 



IV. EQUILIBRIUM PROPERTIES AND SMALL- WORLD EFFECTS ON A 

SMOOTH FITNESS LANDSCAPES 

In the following we shall study the effects of the cooperation between short-range and 
sparse long-range mutations on the equilibrium properties of the model. 

Let us study the model in the presence of a smooth static fitness landscape. In this case 
the fitness A does not depend explicitly on the distribution p, and Eq. @ can be linearized 
by using unnormalized variables z(x, t) satisfying 

z(x, t + l) = J2 A(y)M (x, y)z(y, t), (6) 
y 
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with the correspondence 

z(x, t) 



p(x,t) 



In vectorial terms, Eq. (||) can be written as 

z(t + 1) = MAz(t), (7) 

where M xy = M(x, y) and A xy = A(x)8 xy . 

When one takes into consideration only point mutations (M = M s ), Eq. @ can be read 
as the transfer matrix of a two-dimensional Ising model [pl| -p3[|, for which the genotypic 
element <Ji{t) corresponds to the spin in row t and column i, and z(er,t) is the restricted 
partition function of row t. The effective Hamiltonian V (up to constant terms) of a possible 
genealogical history {x(t)} or {er(t)} from time 1 < t < T is 

v = E ( iT.^ti + x ) + v ( x ®) ) > ( 8 ) 

t=i \ i=i / 

where 7 = - ln(/i s / (1 - /i s )). 

This peculiar two-dimensional Ising model has a long-range coupling along the row (de- 
pending on the choice of the fitness function) and a ferromagnetic coupling along the time 
direction (for small short range mutation probability). In order to obtain the statistical 
properties of the system one has to sum over all possible configurations (stories), eventu- 
ally selecting the right boundary conditions at time t — 1. The bulk properties of Eq. ([Bp 
cannot be reduced in general to the equilibrium distribution of a one dimensional system, 
since the transition probabilities among rows do not obey detailed balance. Moreover, the 
temperature-dependent Hamiltonian (f|) does not allow an easy identification between en- 
ergy and selection, and temperature and mutation, what is naively expected by the biological 
analogy with an adaptive walk. 



A. Long-range mutations 

Let us first consider the long-range mutation case. Eq. (0), reformulated according to 
Eq. ([D, corresponds to 

z(t + r) = (AM e ) T z(t). 

Since it is easier to consider the effects of A and Mi separately, let us study in which limit 
they commute. The norm of the commutator on the asymptotic probability distribution p 
is 

\\[AM e ]\\ = J2UM i } xy p(y)\, 

xy 

and it is bounded by /i^c, where c = max^ \ A XX — A yy \. In the limit /i^c — > (i.e. a very 
smooth landscape), to first order in c we have 

{AM £ ) T = A T M\ + C^t^A^M;- 1 , 
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which is the analogous of the Trotter product formula. 

When t is order M\ is a constant matrix with elements equal to 1/2 L , and thus 

MeP is a constant probability distribution. If \ii is large enough, (AMp) T = A T ~M.\. 

The asymptotic probability distribution p is thus proportional to the diagonal of A 1 ^ 1 : 

p(x) = Cexp^^T) (9) 

i.e. a Boltzmann distribution with Hamiltonian V(x) and temperature \xt. This corresponds 
to the naive analogy between evolution and equilibrium statistical mechanics. In other words, 
the genotypic distribution is equally populated if the phenotype is the same, regardless of 
the genotypic distance since we used long-range mutations. 

We have checked numerically this hypothesis, by iterating Eq. for a time T large 
enough to be sure of having reached the asymptotic state. We plotted the logarithm of the 
probability distribution p(x) versus the value of the Hamiltonian V(x). We computed the 
slope l//x e of the linear regression. The quantity \i e is the effective "temperature" of the 
probability distribution according to the equilibrium hypothesis. 

The results for the mean-field mutation matrix Mi are shown in Figure |3|. We see that 
the equilibrium hypothesis is well verified in the limit fie ^> c; and that convergence is faster 
for a rough landscape. In all cases the effective temperature /i e is close to the expected 
value, i.e. to the mutation rate m. 



B. Short-range mutations 

The above results only hold qualitatively for pure short-range mutations as shown in 
Figure A small dispersion of points in figure implies that the genotypes can be divided 
into evenly-populated groups sharing approximately the same fitness. This is always the 
case for the additive fitness landscape (Ti contribution), since in this case the position of 
symbols in the genotype is ininfluent, and the short-range mutations are able to homogenize 
the distribution inside a group. This is only approximately valid for the J contributions, 
since in this case the fitness depends on pairs of symbols, while mutations only act on 
single symbols. However, as shown in Figure §-d, the homogeneous state is reached for a 
sufficiently high mutation probability. Finally, the homogeneous state is never reached for 
the very rough landscape case (7i contribution), even though the linear relation is satisfied 
in average. 

In order to obtain a quantitatively correct prediction, one has to consider that the re- 
sulting slope /i e is related to the second largest eigenvalue Ai of the mutation matrix by 
fi = 1 — Ai. When the fitness only depends on the "external field" term 7i, in the asymp- 
totic state the short-range mutations connects group of equal fitness. This fact is reflected 
by the vanishing dispersion of points in Figure |4]-a and |]-b. 

In all cases, we expect that in the limit of a very smooth landscape, \x e tends towards 
the limiting value 2[i s jL of Eq. flTT|). This limit is reached faster when only the Tt term is 
present. This implies that for large genotype, the effective temperature due to short-range 
mutations is vanishing. 
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In the opposite case, when the selection is strong, the application of the matrix A 
"rotates" the distribution p in a way which is practically random with respect to the Fourier 
eigenvectors of M s . Thus, the effective second eigenvalue of the mutation matrix is given 
by 1 — fi s , obtained averaging over all the eigenvalues of Eq (p]) . Consequently, we obtain 
fi e ~ fi s . This limit is reached faster in the case of a strongly disordered fitness landscape, 
i.e. in the limit of large /C. When only the J term is present, one observes an intermediate 
case. 

C. Small- world effects 

Let us now consider the case of a sparse long-range mutation matrix, coupled to a stronger 
short-range matrix, i.e. fi s ^> fie- We performed simulations with fi s = 0.1 and fie = 0.01, 
L = 6, 8 and 10 and varying s. The results are summarized in Table |. 

The effects of the two kinds of mutations is additive, implying that, for small fie, the 
distributions are visually similar to the short-range case, Figure |j. However, as soon as s is 
greater than a threshold s c , the effective temperature fi e increases by the expected long-range 
contribution, fie. This increment is very relevant when the effect of short-range mutations 
are vanishing, i.e. for smooth landscapes (H and J contributions) and large genotypes. 
The effect is less relevant in the disordered case, (/C contributions), since in this case the 
contribution to the effective temperature by the small-range mutations do not decrease with 
L. 

By increasing the weigth of the sparse long-range mutations, also the probability distri- 
bution becomes similar to the long-range case, Figure |3|. We performed simulations with 
fi s = 0.1 and fie — 0.1, L = 6, 8, 10 varying s. The results are shown in Table Q. Notice that 
the value of s c (estimated visually from the plot of data) is rather insensitive to L and the 
parameters of the fitness V(x). 

We quantified the distance between the resulted distribution and the Boltzmann one 
Eq. (|D through the computation of the average square difference from linear regression, y 2 . 
In Figure |5] we show that \ 2 becomes very small as soon as s > s c , which seems to vanish 
with L — ► oo. 

This implies that, for large genotypes, a vanishing fraction of long-range mutations, 
coupled to small-range ones, are sufficient to establish the statistical mechanics analogy of 
selection and mutations. 

V. CONCLUSIONS 

We studied some simple models of asexual populations evolving on a smooth fitness 
landscape, in the presence of point mutations (small-range jumps in genotypic space) and 
other genetic rearrangements (long-range jumps). 

We have computed analitically the spreading velocity of an initially homogeneous in- 
oculum on a flat fitness landscape, for the short-range and the long-range mean-field (all 
mutations equiprobable) cases. Since in the real situation only a small set of the all pos- 
sible mutations can occur, we have also considered the quenched version of the long-range 
mutation matrix. In this case we have shown that a small- world effect is present, since 
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even a small fraction of quenched long-range jumps makes the results indistinguishable from 
those obtained by assuming all mutations equiprobable. These results still hold for a smooth 
fitness landscape. 

We have investigated this issue further, studying the equilibrium properties of the system 
in the presence of a smooth fitness landscape. In this framework, it has been possibile to 
show that the equilibrium distribution is a Boltzmann one, in which the fitness plays the role 
of an energy, and mutations that of a temperature. We have checked numerically this result 
for different fitness landscapes, and a mean-field long-range mutation mechanism. As in 
the previous case, a small-world phenomenon appears, since similar results can be obtained 
using a combination of sparse long-range and short-range mutations. 
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APPENDIX: SPECTRAL PROPERTIES OF THE MEAN-FIELD MUTATION 

MATRICES 

Both M s and Mp, Eqs. (|T|-0), are Markov matrices. Moreover, they are circular matrices, 
since the value of a given element does not depend on its absolute position but only on the 
distance from the diagonal. This means that their spectrum is real, and that the largest 
eigenvalue is Ao = 1. Since the matrices are irreducible, the corresponding eigenvector 
£ is non-degenerate, and corresponds to the flat distribution £o(x) = 1/2 L . In analogy 
with circular matrices in the usual space, one can obtain their complete spectrum using the 
analogous of Fourier transform in a Boolean hyper-cubic space. Let us define the "Boolean 
scalar product" 0: 



x y = (B x ^ 



i=l 



where the symbol © represent the sum modulus two (XOR) and the multiplication can be 
substituted by an AND (which has the same effect on Boolean quantities). This scalar 
product is obviously distributive with respect to the XOR: 

{x®y)Qz = (xQz)®(yQ z). 

Note that the operation x © y is performed bitwise between the two genotypes: {x@y)i = 

Xi® Hi- 

Given a function f(x) of a Boolean quantity x e {0, 1} L , its "Boolean Fourier transform" 

is f(k) (k e {0, 1} L ) 



2 

X 
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The anti-transformation operation is determined by the definition of the Kronecker delta 

\£C0fc 



X 



and is given by 

fix) = j2(-ir Bk m. 

k 

One can easily verify that the Fourier vectors £&(a;) = (— l) xQk are eigenvectors of both 
Mi and M s , with eigenvalues 

A = 1, 

\ i ^ ( 10 ) 

A fc = l-fi e - 7^—[i 

for the long-range case, and 

Ac =1, 

2/i s rf(fc,0) (11) 
A fc -1 , 

for the short-range case, where d(x, y) is the Hamming distance between genotypes x and 

y- 

The computation of p(t), Eq. ([5]), is easily performed in Fourier space, using the analogous 
of Parsifal theorem: 

E f{x) 9 (x) = e E E (-ir 0(feffifc,) 

£C £C fc fc' 

= 2* 

fc 

where we have used the property fc © fc' = if and only if ki = k\ for each component i. 

Let us denote by e n the unit vector along direction n, i.e. (e n )j = 5 ni . 

The Fourier transform of the distance d(x,y) is obtained considering that (— iy x ®v) &e ™ 
gives 1 if x n — y n and —1 otherwise, thus 



d(*,y) = iU-|>l) 

V n=0 



(a;®y)0e„ 
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We obtain 



d X0 (k) = ±J2 d ( x > x °)(-V X ° k 



1 1 



L-l 



^E(^-E(-i) ( 



n=Q 



2 dk0 ~ 2 



\n=0 / \ a; 

1 L_1 

*»-^£(-ir 0e "<W; 



n=0 



i.e. 



'L/2 


if fc = 0, 


,-1/2 


if fc = e n 


1/2 


if fc = e n 




otherwise 



d X0 (k) = < 



The probability distribution p(x,t) can be expanded on the eigenvector basis £k{ x ) of 



M: 



and 



i.e. 



Thus 



p(a;,0) = £ = ^ a h (-l)»® h 

k k 

p(x,t) = M*p(aj,0) = £a fc Aj£ fc (a:) 

p(fc,t) = a fc A' fc . 
p(t) = ^2d X0 (k)p(k,t) 

k 

= ^2d X0 (k)a k X t k 
k 

L L ~ l 1 



2 u ^ 2 

n=0 



If at t = the distribution is concentrated at Xq = (p(cc, 0) = S x o) then — 1 for all 
fc. In both the short and long-range cases, A en does not depend on n (see Eqs. (|10D-(|TTD), 
and thus 



P(t) = 2 ( A o " Ai) 
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For the short-range case we have 



and the characteristic spreading time is r s = L/2p s . For t small compared with r s (L — > oo) 
we have 

p s (£) ~ p s t = v s t. (12) 
For long-range mutations we have 

^(-H-iib)') 

with r| = l//i£. For t << (p^ — > 0) we have 

Pi (t) ~ = u,t. (13) 

The behavior of p(t) for short times, vanishing mutation probabilities and large geno- 
types, Eqs. (|I~2"|) -(|T3l), is rather trivial. In these approximations one can neglect back mu- 
tations, and obtain the same results on a Cayley tree. However, the full analysis gives the 
exact behavior of p(t) for all times. 

In the mixed case one has M = M s Me. Since M s and Mi share the same eigenvectors, 
A = A S A^ and, in the previous approximations, 

v = v s + v e . (14) 
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FIGURES 




FIG. 1. The spreading velocity v vs. sparseness s for three values of the genotype length 
L; H£ = 10~ 4 , n s = 10 -3 , flat fitness landscape. The dashed lines indicate the mean field value 
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S 

FIG. 2. The fluctuations Var(u) of the spreading velocity v vs. sparseness s for for three values 
of the genotype length L; fi£ = 10~ 4 , fi s = 10~ 3 , flat fitness landscape. The dashed line represent 
the law Var(u) ~ s -1 / 2 . 
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FIG. 3. Numerical check for long-range mutations. In the simulations we set L = 8, m = 0.1 
and fi s = 0. We varied TL (a,b), J (c,d) and fC (e,f), setting all other parameters to zero, (a) 
H = 0.01, J = 0, /C = 0; (b) U = 0.001, J = 0, K, = 0; (c) H = 0, J = 0.01, K. = 0; (d) H = 0, 
J = 0.001, K = 0; (e) H = 0, J = 0, K = 0.1; (f) H = 0, J = 0, JC = 0.01. In the figures t 
indicates the number of generations, \i e the reciprocal of the slope of linear regression. 
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FIG. 4. Numerical check for short-range mutations. In the simulations we set L = 8, m = 
and fi s = 0.1. We varied Ti (a,b), J (c,d) and K, (e,f), setting all other parameters to zero, (a) 
U = 0.1, J = 0, tC = 0; (b) H = 0.01, J = 0, K = 0; (c) H = 0, J = 0.01,/C = 0; (d) H = 0, 
J = 0.001, K = 0; (e) H = 0, J = 0, K, = 0.01; (f) H = 0, J = 0, K = 0.001. In the figures t 
indicates the number of generations, ji e the reciprocal of the slope of linear regression. 
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FIG. 5. Scaling of the average square difference from linear regression, x 2 vs - sparseness s, for 
three values of the genotype length L; = 0.1, fj, s = 0.1, H = 0, J = 0, K = 0.1. Averages taken 
over 20 runs. 
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TABLES 
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L = 6 


L = 8 


L = 10 


L = 12 




He 


Me 


Sc 




Me 


Sc 


u (0) 

fJ-e 


Me 






Me 


Sc 
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0.035 


0.045 


3 


0.025 


0.035 


3 


0.020 


0.030 


2 


0.017 


0.027 


2 


j 


0.071 


0.082 
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0.053 


0.063 
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0.045 


3 


JC 


0.080 


0.105 


6 


0.090 


0.105 
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0.096 


0.107 


6 


0.098 


0.111 


3 



TABLE I. Effettive temperature \i e for several values of the genotype length L and parameters 
of the fitness V(x), Eq (£|). Here {i s = 0.1, m = 0.01; the row labeled H stands for H = 0.1, 
J = K, = 0, the row labeled J stands for J = 0.1, H = K, = 0, the row labeled K, stands for 
K, = 0.1, H = J = 0. We report the value or /i e ^ for sparseness s = (only small-range mutations) 
and the average value 7i e for s > s c , where s c is estimated visually from the plot of data (see also 
Fig. |). 
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